Computing Linear Discriminants for Idiomatic Sentence Detection

نویسندگان

  • Jing Peng
  • Anna Feldman
  • Laura Street
چکیده

In this paper, we describe the binary classification of sentences into idiomatic and non-idiomatic. Our idiom detection algorithm is based on linear discriminant analysis (LDA). To obtain a discriminant subspace, we train our model on a small number of randomly selected idiomatic and non-idiomatic sentences. We then project both the training and the test data on the chosen subspace and use the three nearest neighbor (3NN) classifier to obtain accuracy. The proposed approach is more general than the previous algorithms for idiom detection — neither does it rely on target idiom types, lexicons, or large manually annotated corpora, nor does it limit the search space by a particular linguistic con-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Carrier detection in hemophilia A: a cooperative international study. II. The efficacy of a universal discriminant.

Factor VIII (F.VIII) and von Willebrand factor (VWF):Ag data collected by eight laboratories on a total of 336 obligatory carriers of hemophilia A and 137 normal women were used to answer several questions concerning the construction of linear discriminants for carrier detection. It was found: that a "universal" linear discriminant can be constructed which is suitable for use in all laboratorie...

متن کامل

Idiomatic versus Literal Interpretations Ditropically Ambiguous Sentences Of

"Ditropically" ambiguous sentences (each having both a literal and an idiomatic interpretation) were prepared tbr listener's discrimination judgments, and fbr silent readers' rankings on an "idiomaticity'" scale. Listeners were unable to discriminate the literal from the idiomatic versions when presented with randomized single sentences excised from paragraph contexts. There was a bias toward i...

متن کامل

Detecting Japanese idioms with a linguistically rich dictionary

Detecting idioms in a sentence is important to sentence understanding. This paper discusses the linguistic knowledge for idiom detection. The challenges are that idioms can be ambiguous between literal and idiomatic meanings, and that they can be “transformed” when expressed in a sentence. However, there has been little research on Japanese idiom detection with its ambiguity and transformations...

متن کامل

Is the comprehension of idiomatic sentences indeed impaired in paranoid Schizophrenia? A window into semantic processing deficits

Schizophrenia patients have been reported to be more impaired in comprehending non-literal than literal language since early studies on proverbs. Preference for literal rather than figurative interpretations continues to be documented. The main aim of this study was to establish whether patients are indeed able to use combinatorial semantic processing to comprehend literal sentences and both co...

متن کامل

Lexical or syntactic control of sentence formulation? Structural generalizations from idiom production.

To compare abstract structural and lexicalist accounts of syntactic processes in sentence formulation, we examined the effectiveness of nonidiomatic and idiomatic phrasal verbs in inducing structural generalizations. Three experiments made use of a syntactic priming paradigm in which participants recalled sentences they had read in rapid serial visual presentation. Prime and target sentences co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010